Context dependent phoneme duration modeling with tree-based state tying

نویسندگان

  • Myoung-Wan Koo
  • Ho-Hyun Jeon
  • Sang-Hong Lee
چکیده

In this paper, we propose phoneme duration modeling methods with tree-based state tying. Two kinds of phone duration modeling methods are suggested. The first is context independent phoneme duration model in which duration parameters are stored in each phone. The second is context dependent duration model in which duration parameters are stored in each state being shared by context dependent phone. We split duration parameters of each context dependent phoneme into three kinds of tied states estimated by tree-based clustering. The duration of each state is modeled by Gamma distribution function. Both HMM and duration parameters are stored in states tied for expressing all context dependent phones in a phone. The duration parameters of context dependent phoneme are automatically generated from state duration parameters in the initialization stage of recognition. Context dependent phoneme duration model is compared with context independent phoneme duration model as well as with no duration model. We make experiments with database collected through telephone network service. Experimental results demonstrate that duration information rejects OOT (out-of-task) words very well and that context dependent duration model yields the best performance among three methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition

Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...

متن کامل

English Alphabet Recognition Based on Chinese Acoustic Modeling

How to effectively recognize English letters spoken by Chinese people is our major concern in the paper. Some efforts are made to build Chinese extended Initial/Final (XIF) based HMMs for English alphabet recognition which can be integrated with large vocabulary continuous Chinese speech recognition (Chinese LVCSR) system based on a same XIF set. The alphabet-specific XIF HMMs are built using c...

متن کامل

Analysis of Duration Prediction Accuracy in HMM-Based Speech Synthesis

Appropriate phoneme durations are essential for high quality speech synthesis. In hidden Markov model-based text-tospeech (HMM-TTS), durations are typically modeled statistically using state duration probability distributions and duration prediction for unseen contexts. Use of rich context features enables synthesis without high-level linguistic knowledge. In this paper we analyze the accuracy ...

متن کامل

A Comparative Evaluation of GMM-Free State Tying Methods for ASR

Deep neural network (DNN) based speech recognizers have recently replaced Gaussian mixture (GMM) based systems as the state-of-the-art. While some of the modeling techniques developed for the GMM based framework may directly be applied to HMM/DNN systems, others may be inappropriate. One such example is the creation of context-dependent tied states, for which an efficient decision tree state ty...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEICE Transactions

دوره 88-D  شماره 

صفحات  -

تاریخ انتشار 2004